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Abstract — A low complexity, essentially-ML decoding tech- 
nique for the Golden code and the 3 antenna Perfect code was 
introduced by Sirianunpiboon, Howard & Calderbank. Though 
no theoretical analysis of the decoder was given, the simulations 
showed that this decoding technique has almost maximum- 
likelihood (ML) performance. Inspired by this technique, in this 
paper we introduce two new low complexity decoders for Space- 
Time Block Codes (STBCs) - the Adaptive Conditional Zero- 
Forcing (ACZF) decoder and the ACZF decoder with successive 
interference cancellation (ACZF-SIC), which include as a special 
case the decoding technique of Sirianunpiboon et al. We show 
that both ACZF and ACZF-SIC decoders are capable of achieving 
full-diversity, and we give sufficient conditions for an STBC to 
give full-diversity with these decoders. We then show that the 
Golden code, the 3 and 4 antenna Perfect codes, the 3 antenna 
Threaded Algebraic Space-Time code and the 4 antenna rate 2 
code of Srinath & Rajan are all full- diversity ACZF/ACZF-SIC 
decodable with complexity strictly less than that of their ML 
decoders. Simulations show that the proposed decoding method 
performs identical to ML decoding for all these five codes. These 
STBCs along with the proposed decoding algorithm outperform 
all known codes in terms of decoding complexity and error 
performance for N t < 4 transmit antennas. We further provide a 
lower bound on the complexity of full-diversity ACZF/ACZF-SIC 
decoding. All the five codes listed above achieve this lower bound 
and hence are optimal in terms of minimizing the ACZF/ACZF- 
SIC decoding complexity. Both ACZF and ACZF-SIC decoders 
are amenable to sphere decoding implementation. 

I. Introduction 

The problem of constructing Space-Time Block Codes 
(STBCs) that provide good error performance with low com- 
plexity decoding has drawn much attention in the literature. 
Low complexity decoding techniques such as zero-forcing 
(ZF), Minimum Mean Squared Error (MMSE) [lj and Partial 
Interference Cancellation are capable of achieving full- 
diversity, but their error performance is considerably inferior 
to that of maximum-likelihood (ML) decoding. Consequently 
much work has been directed towards designing full-diversity 
codes that admit low complexity decoding with ML perfor- 
mance |3|-| 16 1 . 

We now briefly review the best known full-diversity, low 
ML decoding complexity, high-rate STBCs for Nt < 4 trans- 
mit antennas. The Silver code ifTOl . ifTTI has the least known 
ML decoding complexity of M 2 (where M is the size of 
the complex constellation used) among known full-rate full- 
diversity codes for 2 transmit antennas. This is followed by 



the Golden code 031, HSl that has a higher ML decoding 
complexity M 2 - 5 llT2ll . |[T3l . but has superior coding gain 
and error performance than the Silver code. For 3 antenna 
systems the full-rate, full-diversity code with least ML de- 
coding complexity is the Threaded Algebraic Space-Time 
(TAST) code flU with a complexity of M 7 HU, whereas 
the code with the best known coding gain is the 3x3 Perfect 
code [20 1 that has a complexity of M 9 . For 4 antenna systems, 
the 4x4 Perfect code has the best coding gain and an 
ML decoding complexity of M 13 5 ll2D . Among the rate 2 
codes for asymmetric MIMO systems with 4 transmit and 2 
receive antennas the Srinath-Rajan code lfl2l has the least ML 
decoding complexity M 4 5 and best error performance. 

It is possible to reduce the decoding complexity further by 
using a non-ML decoder without trading off the error perfor- 
mance, unlike ZF, MMSE or Partial Interference Cancellation 
receivers where the decoding comfort is achieved at the cost 
of higher probability of error. Such a decoding technique was 
proposed by Sirianunpiboon, Howard & Calderbank in [15], 
[16 1 for the Golden code and the 3 antenna Perfect code. 
Though no theoretical analysis of the achievable diversity or 
coding gain was provided, the simulations showed that these 
decoders have essentially the same performance as an ML 
decoder for the Golden and the three antenna Perfect code, 
but with complexity less than that of ML decoding. 

The contributions and organization of this paper are as 
follows. 

• Inspired by [15|, [16|, we introduce two new low com- 
plexity decoding algorithms for STBCs - the Adaptive 
Conditional Zero-Forcing (ACZF) decoder and the ACZF 
decoder with successive interference cancellation (ACZF- 
SIC) (Sections III-Bb . We show that these decoders are 
capable of achieving full-diversity in wireless Rayleigh 
faded channels and give sufficient conditions for an STBC 
to give full-diversity with ACZF/ACZF-SIC decoding 
(Sections ITTTb - The proposed decoders include as special 
case the decoding technique of 1151 . ifTBI for the Golden 
and 3 antenna Perfect code. 

• We show that the best known codes for 2, 3, 4 antennas: 
the Perfect codes for 2, 3, 4 antennas, the 3 antenna TAST 
code and the Srinath-Rajan code, are all full-diversity 
ACZF/ACZF-SIC decodable with complexity strictly less 
than their ML decoding complexity (Section [TV) . See 



TABLE I 

Delay-optimal codes with low complexity, full-diversity decoding algorithms 



Code 


Transmit 
Antennas 

N t 


Rate 
R 


ML Decoding 
Complexity 


ACZF/ACZF-SIC 
Decoding 
Complexity 


Full-diversity codes with least known ML decoding complexity 


ML Decoding Complexity 


Code 


Golden 


2 


2 


M 2 ' 5 


M 2 


M 2 


Silver |10|, |11| 


Perfect 


3 


3 


Af 9 


M 7 


M 7 


TAST 


TAST 


3 


3 


M 7 


M 6 


M 7 


TAST 


Perfect 


4 


4 


M 13.5 


M 12 


A/13.5 


Perfect 


Srinath-Rajan 


4 


2 


A/ 4 - 5 


M 4 


Af 4 - 5 


Srinath-Rajan 



M is the size of the underlying complex constellation. 



Table U for comparison of ACZF/ACZF-SIC and ML 
decoding complexities of these codes. 
Simulation results (Section IVU show that the proposed 
decoder performs identical to the ML decoder for all 
these five codes, i.e., reduction in decoding complexity 
is achieved without trading off error performance. Thus 
these STBCs along with the proposed decoding algorithm 
outperform all known codes in terms of decoding com- 
plexity and error performance for Nt < 4. In particular, 
the Golden code outperforms the Silver code in both 
decoding complexity and error performance. See Table [i] 
for a comparison of the complexities of known full- 
diversity low decoding complexity codes. 
We derive a lower bound on the complexity of full- 
diversity ACZF/ACZF-SIC decoding (Section lIlLBl ). All 
the five codes mentioned above achieve this lower bound 
and hence are optimal in terms of minimizing the 
ACZF/ACZF-SIC decoding complexity. 
Both ACZF and ACZF-SIC algorithms are amenable to 
sphere decoding ll22l implementation. We show that the 
ACZF-SIC decoder can be implemented with only a 
few minor modifications to the original sphere decoding 
algorithm (Section PvT>- 



The channel model is discussed in Section III-AI and the 
paper is concluded in Section IVIII 



Notation: Matrices (vectors) are denoted by bold, uppercase 
(lowercase) letters. The Hermitian, transpose and Frobenius 
norm of a matrix X are denoted by X H , X T and ||X|| 
respectively. The determinant of a square matrix X is denoted 
by det(X). For any vector u the diagonal matrix with the 
elements of u on the main diagonal is denoted by diag(u). 
Unless used as a subscript or to denote indices, j represents 
\J — 1. For any set I, its complement in the corresponding 
universal set is denoted by I c . The expectation operator is 
denoted by E(-) and the probability of an event 8 is denoted by 
P(£). For any vector u, its £ th component is denoted by u(£). 
The nearest integer operator is denoted by rnd(-). The notation 
represents the all zero matrix of the appropriate dimension. 
For any matrix A let vec(A) denote the vectorization of A, 
i.e., the vector obtained by stacking the columns of A one 
below another. 



II. Adaptive Conditional Zero-Forcing Decoder 

We first explain the MIMO channel model used in this paper 
and then introduce the ACZF/ACZF-SIC decoding of STBCs. 

A. Channel Model 

We consider a quasi-static Rayleigh flat-fading channel 



Y = VSNRXH + N 



(1) 



with Nt transmit antennas, N r receive antennas and delay 
T. The transmit matrix X takes values from a Space-Time 
Block Code (STBC) C which is a finite subset of C TxN . 
The Nt x N r channel matrix H is known at the receiver but 
not at the transmitter. The entries of H and the noise matrix 
N are assumed to be independent and identically distributed, 
zero mean circularly symmetric complex Gaussian random 
variables with unit variance. The transmit matrix X satisfies 
E(||X|| 2 ) = T, so that the average signal to noise ratio at 
each receive antenna is equal to SNR. It is assumed that the 
STBC C is obtained via a design [23 1 S = Y^=\ S i-A, where 
Aj £ C TxN , i = 1, . . . , K , are the linear dispersion or weight 
matrices, and s\,...,sjc are complex symbols that assume 
values from a finite constellation A C C. i.e., 



C = 



' K 



e A, i = i,...,K 



The signal set A is usually (but not always) a QAM, HEX 

in terms of 
in terms of 



or PSK constellation. The rate of C is R = 

A' log 



complex symbols per channel use, and 
bits per channel use. 



K 

T 



Vectorizing the receive matrix Y in (HJ we obtain 

y 



%/SNRGs 



where y and n are vectorizations of Y and N respec- 
tively, s = [si S2 ■ ■ ■ Sk] t , and the equivalent channel matrix 
G € £N r TxK j s a f unC ];i on f the channel H and the design 
S and is given by [vec(AiH) vec(A 2 H) • • • vec(Aj^H)]. 

Example G.l: The Golden Code [20 1 is a rate R = 2 code 
for Nt = 2 antennas. It encodes K = 4 symbols from a QAM 
alphabet over T = 2 time slots. The design is 
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where r = 1+ 2 > I 1 = — ~ > a = 1 + an d ct = 1 + jr. The 
four weight matrices are 



Ai = 
A 3 = 
Consider the 
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G = [vec(AiH) vec(A 2 H 



and A4 = 

N r = 2 and 
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ar 



channel 



matrix H = 



let the 
Then the equivalent channel 
) • • ■ vec(A4H)] is given by 



ahi jahj, arh\ japh^ 

a/13 oh\ a\i,h^ ar/ii 

a/12 joihi arh.2 jcifxh^ 

a/14 a/i2 otfihi arhi 



(2) 



B. Adaptive Conditional Zero-Forcing Decoder 

We will introduce some notations before explaining the 
ACZF decoder. Let X\ , . . . , Xjj be any L subsets of {1 , . . . , K } 
each of cardinality A. The subsets T\ , . . . , Xl need not be 
a partition of {1, ...,K}, and they may have non-trivial 
intersections also. For any IC {1, . . . , K} denote by Sx the 
vector comprising of those symbols Si whose indices belong to 
X, i.e., if X = {11,12, ■ ■ ■ , i\x\} with i\ < i 2 < ■ ■ ■ < i\x\ then 
si = [si 1 Si 2 ■ ■ ■ Si m ] T '. Similarly let Gx be the submatrix of 
G comprising of those columns whose indices belong to I, 
i.e., [vec(Aj 1 H) vec(A, 2 H) • • ■ vec(Aj |x| H)]. Further, let the 
operator {-y denote the left pseudo-inverse of a matrix, and 
u(j) denote the j th entry of a vector u. 

The ACZF decoder, given in Algorithm Q] functions as 
follows. Given y and G, from among Zi, . . . ,Il the subset 
X m with the maximum value of det(G| ? Gx m ) is chosen. 
For each of the possible |^l| i:f ~ A values of the vector s^e , the 
interference in y from sx= is first removed and the remaining 
symbols sx m for this instantiation of sx^ are decoded by zero- 
forcing resulting in the decoded vector sx m (sx^). At the end 
of this step we have a set of |^| X_A tuples 



{(sx^sx^sxj) I sxc t eA^} 



(3) 



From among these vectors the decoder chooses the tuple s that 
minimizes ||y — Gs|| 2 . 

After the interference from sx= is removed from y the 
symbols in sx m are decoded by a ZF receiver. The post- 
processing signal to noise ratio for decoding sx m is cap- 
tured by det(Gj m Gx m ) fl~3, CD. To optimize the error 
performance the decoder therefore chooses the subset whose 
equivalent channel matrix has the largest determinant. 

Example G.2: Continuing Example G.l, consider 
the ACZF decoder for the Golden code with param- 
eters L = 2, X\ = {1,2} and X% = {3,4}. In this case 
A = 2, sij = [si S2} 7 ', sx 2 = [83 S4] 71 , Gij comprises of 
the first two columns of G in (HJ and Gx 2 comprises 
of the last two columns. The decoder first determines 
m = arg max^ e { 12 } det(G^Gx^). Note that in this case 



Input: Received signal y, equivalent channel matrix G, 

subsets Xx,...,X L 
Output: Decoded symbol vector s 
Set m := 0, maxdet := 0; 
foreach I = 1, ... ,L do 

if maxdet < det(G| f GxJ then 
maxdet := det(G£G z J; 
m := t\ 

end 
end 

Set u as the all zero vector of length A; 
foreach ax^ G A K ~ X do 

Calculate v := -^=G^ m (y - v/SNRGx^ax^); 
foreach j = 1, . . . , A do 

I Find u(j) := argmin ae ^ ||v(j) - a||; 
end 

Set sx m (ax^) := u; 
end 

Set mindis := 00 and &x<= := 0; 
foreach ax= G A K ~ X do 
if 

mindis > ||y - VSNR(Gx s> ax Si - Gx m sx m (ax Si ))|| : 
then 

mindis := 

||y - VSNRCGx^ax^ - G Zm sx m (ax^))|| 2 ; 
sx^ ■= ax^; 

end 
end 

Set s as the concatenation of sx^ and sx (sx<= ); 
Algorithm 1: The ACZF Decoder. 



Z£j = X2- m . Conditioned on Sx 2 _ m the receiver decodes 
Si m (sx 2 _ m ) by zero-forcing. Then from among the M 2 tuples 
{ (sx 2 _ m , sx m (sx 2 _ m )) } the one that minimizes ||y — Gs|| 2 
is chosen to be the output. This decoder for the Golden 
Code was first proposed in [16], but no theoretical analysis 
regarding diversity or minimum achievable complexity was 
provided. ■ 
It is well known ll26ll that at high values of SNR the ZF 
receiver aided with successive interference cancellation (ZF- 
SIC) performs better than mere ZF decoding. The ACZF 
decoder can be integrated with SIC (ACZF-SIC) as well. The 
ACZF-SIC decoder detects the symbols in sx m one by one, 
and removes the effect of the already detected symbols in the 
received vector by interference cancellation. 

Complexity Analysis: Consider the step 
u(j) = argmin ae ,4 ||v(j) — a|| in Algorithm Q] If the 
size of the complex constellation A is M this step requires 
M computations of 1 1 v(j) — a\\. However if A is a regular M- 
ary QAM constellation then this step can be implemented with 
constant complexity independent of M by hard limiting [8|, 
|[T2"1 . Ifl3l as shown in (|4|i and §5^ at the top of the next 
page, where (-)fle and (-)/ m denote the real and imaginary 



(u(j'))ij e = min <. max -|rnd 
(u(j))i m — min < max < rnd 



M-l , \ I r— 1 t/M-1 



2 

/ M - 1 



(v(j)W ,0 ,Vm-i - 



2 

M- 1 



(4) 
(5) 



parts respectively. This step is performed |A|M times 
throughout the decoding process. Thus the complexity order 
of the ACZF decoder is M K ~ X and M K - X+1 for QAM and 
general constellations respectively. Identical results hold true 
for the complexity of ACZF-SIC decoding also. For example, 
for the ACZF decoder for the Golden Code in Example G.2 
K = 4, A = 2 and A is a regular QAM constellation. Hence 
this decoder has a complexity of M K ~ X = M 2 . On the other 
hand the least known ML decoding complexity of the Golden 
Code is M 2 - 5 El, El. 

III. Full-Diversity Criterion 

In this section we give two equivalent full-diversity criteria 
for ACZF/ ACZF-SIC decoders. We then give a lower bound on 
the complexity of full-diversity ACZF/ ACZF-SIC decoding. 

A. Full-diversity Criterion 

The vector v in Algorithm Q] is obtained by multiplying 
another vector by the left pseudo-inverse of Gi m . Thus it 
is implicitly assumed that G% m has full column rank. Note 
that this matrix is the choice from among Gi 1 , Gx L 
with the largest determinant. The ACZF decoding algorithm 
thus assumes that for every channel realization H^O at 
least one of the L matrices G% x , • • • , G% L has full column 
rank. The same is true in the case of ACZF-SIC decoding 
also. In the following theorem we show that this condition, 
which is necessary for the implementation of ACZF/ ACZF- 
SIC decoder, is also a sufficient condition for ACZF to achieve 
the same diversity as the ML decoder. 

Theorem 1: The ACZF decoder achieves the same diversity 
order as the ML decoder if for every channel realization 
H ^ at least one of the L matrices G% 1 , ■ ■ ■ , G% L has full 
column rank. 

Proof: Proof is given in Appendix [A] ■ 
Both zero-forcing and conditional zero-forcing 
decoders [24] are special cases of ACZF decoding. In both 
these cases L = 1 and hence the decoder is not 'adaptive' 
any more, i.e., for every channel H we have m — 1. When 
the number of subsets L = 1 and X\ = {1,...,K} the 
ACZF decoder reduces to the zero-forcing receiver. The 
criterion of Theorem Q] in this case reduces to the full- 
diversity criterion for ZF decoding given in JT], 0. If L = 1 
and 2~l C {l,...,K} then the ACZF decoder reduces to 
conditional ZF decoder [24]. In this case Theorem Q] implies 
that Gx ± be of full column rank for every H ^ and the 
difference of any two codewords in C be of rank N t for the 
conditional ZF decoder to achieve a diversity order of N t N r . 



This coincides with the full-diversity criterion for conditional 
ZF decoders given in Il24l . 

Theorem Q] imposes a criterion on the equivalent channel 
matrix G which is a function of the linear dispersion matrices 
and the number of receive antennas. In the following theorem 
we give an equivalent full-diversity criterion in terms of the 
linear dispersion matrices alone. This criterion shows that 
the sufficient condition in Theorem Q] is independent of the 
number of receive antennas, i.e., either the STBC C satisfies 
the criterion for all N r > 1 or it does not satisfy the criterion 
for any N r , We now introduce some notations towards stating 
the equivalent criterion. For any I — {ii, . . . , im} and any 

vector u = [m u 2 ■ ■ ■ u\x\] € C |x| let Xj(u) = J2j=i u j^i^ 
i.e., Xi(u) is the complex linear combination of the weight 
matrices with indices in I with the elements of vector u 
defining the corresponding complex coefficients. 

Theorem 2: The ACZF decoder achieves the same diversity 
order as the ML decoder if for every choice of Ui 6 C A \ {0}, 
u 2 € C A \ {0}, . . . , u L e C A \ {0}, the LT x N t matrix 

Xx^ui) 
Xi 2 (u 2 ) 



x( Ul 



(6) 



Lxi t (u L )j 

has full column rank. 

Proof: Proof is given in Appendix [B] ■ 

The criteria of Theorems Q] and |2] ensure full-diversity with 
ACZF-SIC decoders also. 

Lemma 1: The ACZF-SIC decoder achieves the same diver- 
sity order as the ML decoder if the STBC satisfies the criteria 
of Theorems Q] or [2] 

Proof: Proof is given in Appendix [C] ■ 

The sufficient condition of Theorem |2] is independent of 
the number of receive antennas N r and the choice of the 
constellation A, and depends only on the weight matrices 
Ai, . . . , Ak- The signal set A can be chosen such that the 
STBC gives full diversity with ML decoding. 

Corollary 1: If the difference of any two codewords of C 
is of rank Nt, and if C satisfies the criterion of Theorem Q] 
or |2 then C achieves a diversity of N t N r with ACZF and 
ACZF-SIC decoding. 

B. Lower Bound on Decoding Complexity 

Consider any code for Nt antennas with delay T that 
satisfies the criteria of Theorems Q] and |2] for certain number 
of receive antennas N r . Then the code satisfies the criterion 
of Theorem Q] for N r = 1 as well. For such a code with 
N r = 1, for every channel H^O the T x A matrix Gx m has 



full column rank. This implies that A < T and that the order 
of decoding complexity is at least M K ~ T and M K ~ T+1 for 
QAM and arbitrary signal sets respectively. If we restrict our 
attention to minimum-delay codes that provide full diversity 
we have T — N t , and the lower bound on decoding complexity 
is M K ~ Nt and M K ~ m+1 for QAM and arbitrary constella- 
tions respectively. 

IV. Full-diversity ACZF/ACZF-SIC Decodable 
Codes with Optimal Decoding Complexity 

In this section we show that some of the best codes known 
for N t = 2, 3 and 4 antennas are fuU-diversity ACZF/ACZF- 
SIC decodable. These include the Perfect Codes [20 1 for 
2, 3 and 4 antennas (the two antenna Perfect Code being the 
Golden Code flj), HE)), the 3 x 3 Threaded Algebraic Space- 
Time Code (TAST) (T9) and the Srinath-Rajan Code EI] 
which is the best known rate 2 code for 4 transmit and 
2 receive antenna MIMO systems. All these codes achieve 
the lower bound on the decoding complexity given in Sec- 
tion IIII-BI The ACZF decoding algorithm for the 3 antenna 
Perfect Code given in this section was first proposed in [15|, 
but without the proof for full-diversity. The results of this 
section are summarized in Table J] The table includes the ML 
and ACZF/ACZF-SIC decoding complexity of these codes, 
and the details of the full-diversity codes for same values 
of (Nt,R) that have the previously least known decoding 
complexity. In all the cases, the proposed decoders have 
the least complexity among all known full-diversity decoding 
methods. 

A. The Golden Code 

In this subsection we show that the ACZF/ACZF-SIC de- 
coder for the Golden Code given in Example G.2 achieves full 
diversity. From Theorem [2] we need to show that the matrix 
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has 



linearly 



independent columns whenever 

L*i «2r\[*3 si] T e c 2 \{o}. 

Proof of full-diversity: Proof is by contradiction. Suppose 
[s\ S2\ T and [S3 S4] T are non-zero and the columns of X are 
linearly dependent. We will first argue that none of s\, . . . , S4 
is equal to zero. If s 2 — then s% can not be zero, as this 
would make the vector [s\ s 2 ] T = 0. Thus, if s 2 = the 
upper 2 submatrix of X is a diagonal full-ranked matrix, and 
thus X is of rank 2. This negates our initial assumption, and 
hence s 2 7^ 0. Using similar argument we have s%, s 3 , S4 7^ 
as well. 

Let [a b] T be a non-zero vector in the nullspace of X. 
Since neither columns of X are zero we have a, b 7^ 0. Now 
consider the first row of X. We have as\a + jas2b = 0. This 
implies that |asia| = |as 2 &| 



i.e., Ij4 = fei. Similarly from 



the second row we obtain 



| S1 | _ \&b\ 



|o| 
lal 



(7) 



Using similar argument with the last two rows of X we obtain 
WlA \a\' 

However and (0 together imply that |t| = 
which is not true. Thus, by contradiction, we have 
shown that X has linearly independent columns when- 
ever [si S2] T and [S3 S4] T are non-zero. Since the 
Golden Code achieves full-diversity with the ML decoder, it 
achieves full-diversity with the ACZF/ACZF-SIC decoder as 
well. ■ 

Thus the Golden code is full-diversity decodable with 
complexity M 2 . Note that this meets the lower bound on the 
decoding complexity given in Section IIII-BI Further, this is a 
reduction by A/ 5 from the ML decoding complexity. 

B. The 3x3 Perfect Code 

This full-diversity ML decodable STBC encodes K = 9 
information symbols that assume values from a HEX constel- 
lation. The delay T = 3, and the ACZF/ACZF-SIC decoder 
employs L = 3 subsets Ii = {1,2,3}, X 2 = {4,5,6} and 



I 3 = {7, 8, 9}. Let 7 = ei 



U : 
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3 
1 
3 1 



weight matrix of the 
the weight matrix of 



k symbol in V" group 



piii 



The 
i.e. 

A 3 (*-i)+k = U fc_1 D£, where Di,D 2 
trices that are specified in ||201 . 

Proof of full-diversity: For any choice of complex vectors 

sij , si 2 , si 3 e 



the symbol s 3 ^_ 1 ) + j. is 
D3 are diagonal ma- 



]! 3 \ {0} we need to show that the matrix 



ELi«6 +fe u fe - 



^2 
^3 



is of rank 3. Since U is unitary it has an orthonormal 
set of eigenvectors and can be decomposed as VAV ff , 
where the columns Vi,V2,V3 of V are the eigenvectors and 
A = diag((7i, (72, 03) is the diagonal matrix comprising of the 
eigenvalues. Therefore, U^ 1 = VA fc - 1 V H , k = 1,2,3, and 
the rank of X and 

ELi s 3+k A*-iv H r>2 

-ELi s 6+k A k - l V H T> 3 _ 
.3 



(9) 



, k-l 



are same. For I = 1,2,3 the matrix Efe=i s 3(,e-i)+k J 
diag(^3(f-i)+i,23(^-i)+2^3^-i)+3) is a diagonal matrix, 
where the vectors [s 3[e _ 1)+1 s 3{e _ 1)+2 s 3{e _ 1)+3 ] T and 
[ z 3(^i)+i z 3(i~i)+2 z 3 (i-i )+3 \ T are related as 



\aa\ ' 
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The three eigenvalues of U are all distinct and hence 
the above transformation matrix is Vandermonde and 
thus is invertible. Since sx e is non-zero, the vector 
[ z 3(e-i)+i z 3(£-i)+2 z 3{e _ 1)+3 ] T is also non-zero, and hence 
at least one of its components is of non-zero value. For each 
£ = 1,2,3 let i t G l e be such that z u ^ 0. The 3 x 3 
submatrix of (0 comprising of the three rows corresponding 

tO Zi-^ , 2^ 2 , Z^ 3 IS 

























Zi 3 



vjt«D 3 



-3 J 

and this matrix is full ranked whenever the matrix on the 
right hand side of the factorization above is full-ranked. By 
direct computation we have verified that for every choice of 
i\ G X\, i 2 G X 2 and i 3 G X 3 this matrix is indeed full-ranked. 
Thus, for every choice of , sx 2 , sx 3 € C 3 \ {0} there exists 
a 3 x 3 submatrix of X that is full-ranked and hence the 
columns of X are linearly independent. This completes the 
proof. ■ 
For the proposed ACZF/ACZF-SIC decoder for the 3 x 3 
Perfect Code A = 3. Since the constellation A is a HEX signal 
set the decoder complexity is M K ~ + = M 7 . 

C The 4 x 4 Perfect Code 

This is a full-diversity ML decodable STBC with pa- 
rameters Nt =4, T — 4 and K = 16. The complex sym- 
bols are encoded using a regular QAM constellation. The 
proposed ACZF/ACZF-SIC decoder for this code employs 
L = 4 subsets X x = {1,2,3,4}, X 2 = {5,6,7,8}, 
2~ 4 = {13,14,15,16}. Let 

/ 
10 
10 
10 



u = 



For 1 < k,£ < 4, the weight matrix of the k th symbol in 
£ th group i.e. the weight matrix of the symbol s 4 ^_ 1 ) + / i , is 
A-&(i-i)+k = U fe_1 D^, where Di,D2,D3,D4, are diagonal 
matrices that are specified in [20 1 . 

The proof of full-diversity for ACZF/ACZF-SIC decoding 
for this code is similar to that of the 3x3 Perfect Code given 
in the previous subsection, and hence we avoid producing it 
here. The complexity of this decoder is M 12 and this achieves 
the lower bound of Section IIII-BI The complexity of ML 
decoding is however M 13 5 lf2~Tl . 

D. The 3x3 TAST Code 

The 3 antenna TAST code gives full-diversity with ML de- 
coding and has parameters T = 3 and K = 9. The information 
symbols are encoded using QAM constellation. The design is 
given in (TTOb at the top of the next page, where 7 = 
and the 3x3 matrix M = [rriij] is the real rotation matrix 
from Il25l . 

This code can be full-diversity ACZF/ACZF-SIC decoded 
using L — 3 subsets with 1% — {1, 2, 3}, I 2 = {4, 5, 6}, 1 3 = 



{7, 8, 9}. The weight matrix of the k symbol in the £ subset 
i.e., that of the symbol s 3 ^ e _ 1 - )+k 

[0 

U = 7 
7 



is U fc T>e, where 



and is the matrix obtained by diagonalizing the I th column 
of M. The proof of full-diversity is similar to that of the 3x3 
Perfect code and hence is omitted. This code achieves the 
minimum ACZF/ACZF-SIC decoding complexity of M 6 . On 
the other hand, the ML decoding complexity of this code is 
M 7 [f!4l . 

E. The Srinath-Rajan Code 

Among all known rate 2 codes that enable reduced complex- 
ity ML decoding (complexity less than M K ) for 4 transmit 
antenna, 2 receive antenna asymmetric MIMO systems, the 
Srinath-Rajan code has the best error performance and least 
ML decoding complexity M 4 " 5 . For 4 and 16 QAM constel- 
lations this code provides full-diversity with ML decoding. In 
this subsection we will show that this code is full-diversity 
ACZF/ACZF-SIC decodable with complexity of M 4 . This 
is a reduction by a factor of M° 5 from its ML decoding 
complexity. 

We now introduce some notations. For any two complex 
numbers a.b let 



(a, b) = 



a 

-b* 



be the Alamouti matrix embedding the complex numbers a 
and b. Note that A(a, b) is a scaled unitary matrix for every 
non-zero vector [a b] T G C 2 , and A(a,b) equals the all-zero 
matrix if and only if both a, b = 0. For any vector u G C 2 
we have ||A(a,&)u|| 2 = (|a| 2 + |&| 2 )||u|| 2 . Let 7 = eT, 
9 = i tan _1 (2), c = cos and s = sin (9. 

The parameters of the Srinath-Rajan code are K — 8 and 
T = 4 and its design is 



cA(s 1 ,s 2 ) -ysA(js 3 ,jS4) 
7cA(s 3 ,s 4 ) sA(jsi,js 2 ) 



sA(js 5 ,js 6 ) 7cA(s 7 ,s 8 ) 
7sA(js 7 ,jss) cA(s 5 ,s 6 ) 



The proposed ACZF/ACZF-SIC decoder has L = 2 subsets 
with 2i = {1, 2, 3, 4} and X 2 = {5, 6, 7, 8}. Since A = N t this 
decoder has the minimum achievable complexity M 4 . 

Proof of full-diversity: In order to use Theorem [2] we need 
to show that for any choice of s^, sx 2 G C 4 \ {0}, the matrix 



cA(si, s 2 j 
7cA(s 3 , s 4 ) 

lsA(js 7 ,js 8 ) 



7sA(js 3 , js 4 ) 
sA(js 1 ,js 2 ) 
7cA(s 7 ,s 8 ) 
cA(s 5 ,s 6 ) 



has full column rank. We will prove this by contradiction. 

Suppose X does not have full-column rank. We will first 
show that none of the component Alamouti blocks is identi- 
cally zero. Say S3, S4 = 0, since Sx 1 7^ 0, at least one of si 
or S2 is non-zero. Thus A(si,s 2 ) and A(jsi, js 2 ) are scaled 
unitary matrices, while A(js 3 ,js4) and A(s3,S4) are zero. 



si 


I 2 S3 


752 


7 S 2 


Si 


1 2 S3 


7 2j5 3 


7S2 


Si 
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Si 
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,7 2 «6 



7 s 6 1 S 5 

Si 7 2s 6 

7S5 S 4 
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7 2 »9 


7S8 


«7 



mi, 3 







"l 2 ,3 
"l 3i3 



(10) 



Thus the upper 4x4 submatrix of X is of full-rank, and hence 
X has full column rank. Since this is a contradiction, at least 
one of S3, S4 is non-zero and hence A(js 3 , JS4) and A(s3, S4) 
are non-zero matrices. Using similar arguments we can prove 
that all the Alamouti blocks in X are non-zero. 

Let h = [hf h^] T , with hi,h2 G C 2 , be a non-zero 
vector in the null-space of X. Since the blocks have Alamouti 
structure, the first two columns and the last two columns of 
X are orthogonal pairs. Thus, the submatrix of X comprising 
the first two columns and the submatrix comprising the last 
two columns are both of rank 2. This implies that neither hi 
nor h 2 is a zero vector. Now consider the first two rows of 
X. Since h is in the null-space we have 

cA(s 1 ,s 2 )h 1 + -fsA(js 3 ,js 4 )h 2 = 0. 

In particular, | |cA(si, s 2 )hi 1 1 2 = ||7sA(js3, jS4)h 2 || 2 , i.e., 

^(i^p+i^niihii^^d^p+i^niih.H 2 . (id 



Let m = argmaxfgj! ,...,L} det(G ; £ i Gx J! ). Then we have 



c 2 (U 3 | 2 + l^l 2 )!]^!! 2 = S 2 (| S i| 2 + ! S2 | 2 )!|h 2 li 2 . (12) 



Using the same technique on the second two rows of X 

= S 2 (| Sl 

From (fTTT i and (fT2l we have 

llhill a 



(13) 



Repeating this method on the last four rows of X we get 
llhill c 



However, ( fT3l and (fT4] i imply that c = s which is 
not true. Thus X has full column rank for any 
choice of sx 1 , sx 2 € C 4 \ {0}. This completes the 
proof. ■ 

V. Sphere Decoding Implementation 



The sphere decoding algorithm 1122 1 can be used to ML 
decode STBCs with low average complexity. This algorithm 
can be modified to implement both the ACZF and ACZF-SIC 
decoders. The ACZF-SIC decoder can be implemented with 
only a minor modification, and we explain this below. 

We consider STBCs that are encoded by a square M-QAM 
constellation. In this case the real and imaginary parts of each 
complex symbol Sj are encoded independently by a vM- 
ary PAM constellation. Thus we use a real sphere decoder 
to implement the ACZF-SIC decoder. When the constellation 
A C C is arbitrary one can use the complex sphere decoder 
with the same modifications as explained below. 



y = VSNR[Gi m Gic] 



s i„ 

s 2> 



n. 



(15) 



Represent %/SNR[Gi m G X cJ by G and [sf m sj c ] T by s , 
then decoding s is equivalent to decoding So. For any complex 
vector u, let u be the real vector obtained from u by replacing 
every element of Ui of u by the tuple [(ui)n e (ui)i m ] T . Let 
Go be the 2N r T x 2K real matrix obtained from the matrix 
Go = [gij] by replacing every element g^j by the 2x2 
matrix 

(9i,j)ne —(gi,j)lm 
\9i,j)lm (gi,j)Re 

Then (fl3T l is equivalent to the real system y = Go§o + n. The 
ACZF-SIC algorithm decodes the first 2A symbols of §0 with 
ZF-SIC by conditioning on the remaining 2(K — A) symbols. 
Note that the elements of Sq are encoded with a \H\I-ary PAM 
signal set with centroid at zero. With a suitable scaling and 
translation this system can be modified into 

y = G x + n, 

where the entries of the 2K dimensional real symbol vector x 
are encoded with the alphabet Z = {0, 1, . . . , vM — 1}. 

Let the QR decomposition of Go be [Q Q'] , where 

R = [rij] is 2K x 2K upper triangular matrix and Q is a 
2N r T x 2K matrix. Searching for a lattice point within a 
squared distance of C from y is equivalent to searching for a 



(14) vector x € Z rn such that 



||y' -Rx|| 2 < C, 

where y' = Q T y and C = C - ||Q' T y|| 2 

The sphere-decoding implementation of ACZF-SIC decoder 
with Schnorr-Euchner enumeration [|28l is given in Algo- 
rithm 12 This algorithm is a modification of Algorithm II 
of ll27l . The variable i represents the current stage of the 
sphere decoder, Ti is the accumulated Euclidean distance of 
the current lattice point from y' at stage and £j represents 
the interference faced by Xi from the already detected symbols 
x i+ i, . . . , X2K- The variable d is the square of the current 
search radius, and is initialized with the value of 00. Steps 2 
and 6 together detect the symbols in the Schnorr-Euchner order 
for i = 2A + 1, . . . , 2K . However, if i < 2 A the symbol Xi is 
ZF-SIC decoded, i.e., it is set to the nearest element in Z ^jj 
after removing the interference from already detected symbols. 
If the symbol Xi is within the search sphere and constellation 
boundaries, then Step 3 updates the values of T^_i, and 
moves to the next stage, i.e., the (i — l) th stage. If, however, 



Input: y' and R 

Output: Decoded symbol vector x. 
%% Initialization 

Step 1. Set i := 2K, T 2K ■= 0, & K := 0, d := oo. 
Step 2. if i < 2A then 

Xi :— min{-\/M — 1, max{0, rnd ^ Vi ~^ 1 \ }} 

end 
else 

x t := rnd ( J 777 i ), \ ■= sign(y- - & - r i}i Xi). 

end 

Go to Step 3. 

Step 3. if d < T, + \y[ - & - r^] 2 then 
%% We are outside the sphere. 
Go to Step 4. 

end 

else if Xi < —I or Xi > \fhl then 

%% We are outside the constellation boundaries. 
Go to Step 6. 

end 
else 

%% We are inside the sphere and 
%% the constellation boundaries 
if i > 1 then 

Vi-1 :— l^j = i r i-l,j x j> 

T 4 _i = T< + |y; - & - r^xtf, » := t - 1, go to 
Step 2. 

end 
else 

%% i = 1, a valid lattice point is found. 
Go to Step 5. 

end 

end 

Step 4. if i = 2K then 

I Terminate, 
end 

else if i < 2 A then 

| i := 2A + 1, go to Step 6. 
end 
else 

I i:=i + l, go to Step 6. 
end 

Step 5. Set d := T x + \y[ - fi - r M 3;i| 2 , x = x, 
i := 2A + 1, go to Step 6. 

Step 6. %% Schnorr-Euchner enumeration. 

a;, := Xi + Ai, Aj = -A, - sign(Ai), go to Step 3. 



Algorithm 2: Sphere decoding implementation of the ACZF- 
SIC decoder. 
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Fig. 1. Golden code, N r = 2. 

i = l then the decoder has found a new lattice point within 
the search sphere and in Step 5 the value of x is updated, the 
new search radius is set to the distance of Rx from y', and 
the decoder goes to the i = 2A + 1 stage. This is different from 
the usual sphere decoder where i is set to 2 after a valid point 
has been found. This difference arises because in ACZF-SIC 
the symbols x± , . . . , x 2 \ are decoded by ZF-SIC, and hence 
require no Schnorr-Euchner enumeration. The case that the 
accumulated Euclidean distance from y' at the current stage 
exceeds >/d is handled in Step 4. In this case if the current 
stage is 2K then the program is terminated as no lattice point 
at a distance \fd or less can be found, else if % > 2A then the 
decoder goes to stage i + 1 for Schnorr-Euchner enumeration, 
and if i < 2A it goes to stage 2A + 1 as the first 2A symbols 
do not undergo enumeration. 

VI. Simulation Results 

The simulation results comparing the performance of the 
ACZF-SIC decoder with the ML decoder for the Golden code, 
3 antenna TAST code, 4 antenna Perfect code and the Srinath- 
Rajan code are given in Fig. [T} @] in that order. In all the cases 
it can be seen that ACZF-SIC performs identical to the ML 
decoder. The simulations provided in [15] show that the same 
holds true for the 3 antenna Perfect code with ACZF decoding. 

From Table J] we see that for each of these five codes the 
ACZF-SIC complexity is strictly less than the ML decoding 
complexity. From the table we also see that the ACZF- 
SIC complexity of these codes is less than or equal to the 
complexity of the full-diversity codes with least known ML 
decoding complexity. Thus the decoding complexity of the 
best performing codes for 2, 3 and 4 antennas with low 
ML decoding complexity (complexity less than M K ) can 
be reduced further by using the ACZF-SIC decoder without 
trading off the error performance. 





Fig. 2. The 3 X 3 TAST code, N r = 3. 



Fig. 4. The Srinath-Rajan code, N r = 2. 
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Fig. 3. The 4 antenna Perfect code, N r = 4. 



VII. Conclusion 

In this paper, we have introduced two new low complexity 
decoding techniques for STBCs - the ACZF and ACZF-SIC 
decoders. We have given sufficient conditions for an STBC 
to give full-diversity with ACZF/ACZF-SIC decoding, and 
shown that the Golden code, 3 and 4 antenna Perfect code, 
3 antenna TAST code and the Srinath-Rajan code can be full- 
diversity ACZF/ACZF-SIC decoded with complexity less than 
that of ML decoding. Simulations show that this advantage in 
decoding comfort comes with no loss in error performance 
with respect to ML decoding. These five codes along with 
ACZF/ACZF-SIC decoding outperform all known codes for 
Nt < 4 in terms of complexity and error performance. The 



following problems are yet to be settled. 

• The reason for the essentially-ML performance of 
ACZF/ACZF-SIC decoders is to be investigated. Is there 
a criterion that ensures that a given non-ML decoding 
technique has error performance close to that of the ML 
decoder? 

. Constructing/Identifying STBCs for N t > 4 that allow 
full-diversity ACZF/ACZF-SIC decoding. For example, 
are all TAST codes [ 19 1 or all codes from Cyclic Division 
Algebras [29| full-diversity ACZF/ACZF-SIC decodable? 

Appendix A 
Proof of TheoremQ] 

In order to prove the theorem we first derive a lower bound 
for det(G^ Gi m ), we then use this result to derive an upper 
bound on the probability of decoding error. 

Assume that the hypothesis of the theorem is true. Let 
h = vec(H). For every t = 1, . . . , L each entry of Gx t is a 
complex linear combination of entries of h. Thus the function 
fi(h) = det(G^Gi^) is a polynomial in entries of h and 
hence is a continuous function. Note that m itself is a function 
of h, and 



det(G£ G X J = /(h) 



max fe (h) 

<e{i.-,£} 



is a also a continuous function since it is the maximum of 
L individual continuous functions. We will now follow an 
argument similar to 0], (2] to derive a lower bound on /. 
Consider the unit sphere S in C NrT . From the hypothesis of 
the theorem, /(h) > for every h 6 S. Since S is a compact 
set and / is continuous, there exists a real number c > such 
that /(h) > c for any h e S. Since each Gx e , 1= 1, . . . , L is 
a linear function of h it satisfies Gx t (h) = ||h|| Gx t fjpgf 

Therefore f e (h) = ||h|| 2A f t (^L_) for all I = 



and hence /(h) = ||h|| 2A / (j^J for any h ^ bfO. Since 

H e5we have / (tm) - c - Thus ' 

det(Gf m Gi m ) = /(h) > c ||h|| 2A . (16) 

Assume that the transmitted information vector is b G A K . 
Let £ i be the event that the tuple (bx^ , bx m ) does not belong 
to the set of vectors (0 obtained at the end of the second step 
of the decoding process. Let £ denote the event of the decoder 
deciding in favour of the wrong codeword. Then 

P(£) = P(£n£i) + P(£n£f) 

< P(£i) + P{£/£$). (17) 

We will now upper bound each of the two terms in the above 
expression to complete the proof. 

Since b is the transmitted vector we have 
y = v / SNR(G Z c i b I o i + G Zm b Im ) + n. The vector 
§x m (bx^) which belongs to the set of vectors in (0 is 
the output of ZF decoder for the system 

y(bxej = y - VSNRGx^bx^ 

= VSNRGx m bx m +n. 

This is a space-time block coded system with a ZF receiver 
whose equivalent channel matrix is Gx m . The full-diversity 
(i.e., order of N t N r diversity) criterion for this system is given 
in Theorem 1 of [ 1 1, and it coincides with (fTSI l. Following the 
same steps as in the proof of Theorem 1 of [1] we can show 
that 

P(£i) < P (§i m (bxej + bxj < dSlMR"^, (18) 

for some real number c\ > 0. 

The term P{£j£f) is the probability that the minimum 
distance decoder does not decide in favour of b given that 
the information symbol vector (bx^ , bx m ) does belong to the 
set (0. This step of the decoder is same as that of ML decoding 
of C except that the search space of the ML decoder has been 
reduced from the entire codebook C of size M K to the set of 
M K ~ X codewords corresponding to the information vectors 
in (0. Since the transmitted codeword belongs to the reduced 
set of codewords, the probability that the wrong codeword is 
output at this step is upper bounded by the probability of error 
of the ML decoder of C. Thus there exists a constant c 2 > 
such that 

P{£/£f) < c 2 SNFT rJV ", (19) 

where r = min{ rank{X - X') | X, X' E C, X ^ X'}. 
Since r < N t , from JTTb . ( TPS! and ( fT9] l we have 
P{£) < (ci + C2)SNR~ rNr at high signal to noise ratios. 
Thus the ACZF decoder achieves the same diversity order as 
the ML decoder. 

Appendix B 
Proof of Theorem[2] 

We will first prove that for a given i = 1 , . . . , L, the matrix 
Gx e has full column rank if and only if Xx t , (u)H 7^ for 



any non-zero vector u = [mi U2 ■ • • u\] G C A . Let It = 
■•■,«>} i-e., 

G Ie = [vec(A ix H) vec(A i2 H) • • • vec(A iA H)], 

The columns of Gx e are linearly independent if and only if the 
matrices A^H, . . . , Ai A H are linearly independent, i.e., the 
matrix X)j=i w jA^H 7^ for any non-zero vector u. This is 
same as Xx { (u)H 7^ for any non-zero vector u. 

Using the result from the previous paragraph we will now 
show that the criteria of Theorems Q] and |2] are equivalent. Let 
£a denote the event that the criterion of Theorem[TJis satisfied, 
and let £b be the event corresponding to Theorem We need 
to prove that £a implies £b and vice-versa, or equivalently 
£ C A implies £ g and vice-versa. The event £a is that for every 
H 7^ 0, at least one the L matrices {Gx^,} is of full column 
rank. From the result in the previous paragraph this happens if 
and only if for any given H^O there exists an i £ {!.,...,£} 
such that Xx^(u£)H 7^ for any 7^ 0. 

Suppose £ A is true. There exists an H ^ and non-zero 
vectors Ux, . . . , U£ such that Xx^ (u^)H = for £ = 1, . . . , L. 
Thus all Xx^(u^) have a common non-zero nullspace, and 
hence the matrix X in (01 does not have full column rank. 
Thus £ B is true. 

Now suppose £ B is true. This means there exist non-zero 
vectors ui,...,Ul such that X has a non-zero nullspace. 
Choose H 7^ to be any matrix with columns in the nullspace 
of X. Thus, XH = and hence Xx f (u^)H = for each 
i = 1, . . . , L for this choice of non-zero vectors ui, . . . , U£. 
This negates the event £a- This completes the proof. 

Appendix C 
Proof of LemmaQ] 

The criteria of Theorems and are equivalent, thus it is 
enough to prove that the criterion of Theorem implies full- 
diversity with ACZF-SIC decoding. The proof is similar to 
that of Theorem The difference lies in upper bounding the 
term P(£i), which itself is upper bounded by the probability 
of error of the ZF-SIC decoder for the system 

y(bx^) = \/SNRG Im bx m + n. 

Let I m ={«!,..., i\} and let SNR^ and SNR'^ be the post- 
processing signal to noise ratios for the symbol Sj fc for ZF 
and ZF-SIC decoders respectively. Further, let P^ and P'fc be 
the probability of error for Si k with ZF and ZF-SIC decoder 
respectively. 

Suppose the criterion of Theorem is satisfied. From the 
proof of Theorem we have that P k < c fe SNR _JVtJV ' at high 
SNR, for some real number > for k = 1, . . . , A. We 
will now show that P' k is of the order of SNR~ WtAV for k = 
1, . . . , A. Given this result on P'k the rest of the proof is similar 
to that of Theorem 

Now let the ZF-SIC decoder decode the symbols in the 
order ix, ix-i> ■ • • > ii> then we have SNRa = SNR'a for 
every channel H. Hence P' A = P A = 0(SNR- N,N '). Now 
consider the ZF-SIC detection of the symbol s,- A _ I . If the 
symbol s; A is decoded correctly the number of interferences 



faced by Si x _ 1 is one less in ZF-SIC receiver than in the 
ZF receiver. In that case the post-processing signal to noise 
ratio SNR'a_i > SNRa-i for any channel H. Therefore if 
Si x were decoded correctly by the ZF-SIC receiver then 
P'a-i < Pa-i = 0(SNR _JVtJVr ). Now the overall prob- 
ability of error for s ix _ 1 is upper bounded by the sum of 
probability of error for s ix and the probability of detecting 
Si x _ 1 wrongly given that s; A was detected correctly. Since 
both these terms are of the order of SNR -JVtJVr we have that 
P'x-i = 0(SUR~ NtNr ). Using a similar argument we can 
prove that P' k = 0(SNR _JVtJVr ) for all k = 1, . . . , A. 
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